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AMENDMENT TO THE CLAIMS 

1. (Currently Amended) A method of building a compressed speech lexicon for use in a speech 
application, comprising: 

receiving a word list and word d e p e nd e nt data; configured for use in the speech 
application, the word list including a plurality of words, as sociat e d with each 
word in the word list having associated word-dependent data selected from the 
group consisting of a pronunciation and part-of-speech; 

selecting a-one of the words from the word list; 

generating an index entry identifying a location in a compressed speech lexicon memory 

for holding the selected word; 
encoding the selected word and its associated word-dependent data to obtain encoded 

words and associated encoded word-dependent data; and 
writing the encoded word and its associated word-dependent data at the identified 

location in the speech lexicon memory. 

2. (Original) The method of claim 1 and further comprising: 

repeating the steps of selecting, generating, encoding and writing lor each word in the 
word list and the associated word-dependent data. 

3. (Previously Presented) The method of claim 2 and further comprising: 

writing codebooks corresponding to the encoded words and the encoded word-dependent 
data in the speech lexicon memory. 

4. (Previously Presented) The method of claim 1 wherein receiving the word list comprises: 

counting the words in the word list; 

allocating a hash table memory based on a number of words in the word list; and 
allocating a speech lexicon memory based on the number of words in the word list. 
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5. (Previously Presented) The method of claim 1 wherein generating an index entry comprises: 

determining a next available location in the speech lexicon memory. 

6. (Previously Presented) The method of claim 5 wherein generating an index entry comprises: 

calculating a hash value for the selected word; 

indexing into the hash table to an index location based on the hash value; and 
writing location data identifying the next available location in the speech lexicon memory 
into the index location in the hash table. 

7. (Previously Presented) The method of claim 6 wherein writing location data comprises: 

writing an offset into the speech lexicon memory that corresponds to the next available 
location in the speech lexicon memory. 

8. (Original) The method of claim 1 wherein encoding comprises: 

providing a word encoder to encode the words in the word list and encoding the words 

with the word encoder; and 
providing word-dependent data encoders for each type of word-dependent data in the 

word list and encoding the word-dependent data with the word-dependent data 

encoders. 

9. (Original) The method of claim 8 wherein encoding further comprises: 

Hufmann encoding the selected word and its associated word-dependent data. 

10. (Original) The method of claim 1 wherein writing the encoded word and word-dependent 
data comprises: 

writing a data structure comprising: 

a word portion containing the encoded word; 

a word-dependent data portion containing the encoded word-dependent data; and 



wherein each word-dependent data portion has an associated last indicator portion 
and word-dependent data indicator portion, the last indicator portion 
containing an indication of a last portion of word-dependent data 
associated with the selected word, and the word-dependent data indicator 
portion containing an indication of the type of word-dependent data stored 
in the associated word dependent data portion. 

11. (Original) The method of claim 10 wherein writing a data structure comprises writing the 
word portion and the word-dependent data portions as variable length portions followed by a 
separator. 

12. (Currently Amended) A method of accessing word information related to a word stored in a 
compressed speech lexicon, comprising: 

receiving the word; 

accessing an index to obtain a word location in the compressed speech lexicon that 
contains information associated with the received word including word-dependent 
data selected from the group consisting of a pronunciation and a part-of-speech; 

reading encoded word information from the word location; and 

decoding the word information for use in a speech application. 

13. (Original) The method of claim 12 and further comprising: 

prior to reading the encoded word information, reading an encoded word from the word 

location; 
decoding the encoded word; and 

verifying that the decoded word is the same as the received word. 

14. (Original) The method of claim 12 wherein reading the encoded word infoimation comprises: 

reading a plurality of fields from the word location containing variable length word 
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information. 

15. (Original) The method of claim 14 wherein reading a plurality of fields comprises: 

prior to reading each field, reading data type header information indicating a type of word 
information in an associated field. 

16. (Original) The method of claim 15 wherein reading a plurality of fields comprises: 

reading a last field indicator indicating whether an associated one of the plurality of fields 
is a last field associated with the received word. 

17. (Original) The method of claim 12 wherein decoding the word information comprises: 

initializing decoders associated with the word and its associated information. 

18. (Original) The method of claim 12 wherein accessing an index comprises: 

calculating a hash value based on the received word; 
finding an index location in the index based on the hash value; and 
reading from the index location a pointer value pointing to the word location in the 
compressed lexicon. 

19. (Currently amended) A compressed speech lexicon builder for building a compressed speech 
lexicon for use in a speech application based on a word list containing a plurality of domains, the 
domains including words and word-dependent data associated with each of the words, the 
compressed speech lexicon builder comprising: 

a plurality of domain encoders, one domain encoder being associated wi th each domain in 
the word list, the domain encoders being configured to compress the words an d the 
associated word-dependent dat a selected from the group consisting of a 
pronunciation and a part-of-speech, to obtain compressed words and compressed 
word-dependent data; 
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a hashing component configured to generate a hash value for each word in the word list; 

a hash table generator, coupled to the hashing component, configured to determine a next 
available location in a speech lexicon memory and write, at an address in a hash 
table identified by the hash value, the next available location in the speech lexicon 
memory; and 

a speech lexicon memory generator, coupled to the domain encoders and the hash table 
generator, configured to store in the speech lexicon memory, for use by the speech 
application, the compressed words and compressed word-dependent data, each 
compressed word and its associated compressed word-dependent data being stored 
at the next available location in the speech lexicon memory written in the hash table 
at the hash table address associated with the compressed word. 

20. Canceled. 

21. Canceled. 

22. (Original) The compressed speech lexicon builder of claim 19 and fiirther comprising: 

a codebook generator generating a codebook associated with each domain encoder. 

23. Canceled. 

24. Canceled. 

25. Canceled. 

26. Canceled. 

27. Canceled. 

28. Canceled. 

29. Canceled. 

30. Canceled. 

3 1 . Canceled. 



